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DIFFERENTIALLY EXPRESSED GENES IN PROSTATE CANCER 

\ 

This application claims the benefit of U.S. provisional application number 60/135,325 
filed May 20, 1999, and U.S. provisional application number 60/135,333 filed May 20, 1999, 
both of which are incorporated herein by reference in their entirety. 

Field of The Invention 

The field of the invention is neoplastic diseases, and especially detection and therapy of 
prostate cancer. 

Background of The Invention 

Prostate cancer has become the most commonly diagnosed malignancy in males in the 
western world, and is the second most common cause of cancer death among men in Europe and 
the United States (Boring, C.C., Squires, T.S., and Tong, T. (1993). Cancer J. Clin. 43, 7-26; 
Carter, H.B., Pianpadosi, S., and Isaacs, J.T. (1990). J. Urol. 143 , 742-746). Worse yet, in recent 
years the annual incidence rate of newly diagnosed prostate cancer, as well as the number of 
prostate cancer deaths continuously rose. 

Androgens not only play a key role in the development and maintenance of the normal 
prostate, but also in the initiation and progression of prostate cancer (Moore, R.A. (1944). 
Surgery 16, 152-167; Huggins, C, and Johnson, M.A. (1947). J. Am. Med. Assoc. 135, 1 146- 
1 152). For example, androgens typically induce cell proliferation and inhibit cell death in the 
healthy prostate gland. Withdrawal of androgens stops proliferation of cells and induces apop- 
tosis with concomitant involution of the prostate gland. Involution upon androgen withdrawal is 
generally a characteristic of a normal prostate gland as well as of a prostate tumor in the early 
stages of the disease, when the tumor still remains androgen dependent. Consequently, androgen 
withdrawal treatment is commonly used to reverse tumor growth. However, in the case of many 
prostate tumors, the tumor recurs af\er a few months or years almost invariably in an androgen 
insensitive state. At this point, successful therapy of prostate cancer is difficult and prognosis for 
survival is usually relatively low. 

Almost all of the biological effects of androgens are mediated by the androgen receptor, a 
hormone-activated transcription factor. Even though the androgen receptor was cloned over ten 
years ago, the mechanisms by which the androgen receptor regulates gene expression is not well- 
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understood. Furthermore, only a very few of its target genes have been identified, including the 
prostate specific antigen (PSA), the related glandular kallikrein 2 (hKLK2), and the androgen 
receptor itself. Another androgen regulated protein, a secreted serine protease with prostate 
restricted expression, termed "prostase" was recently described by Nelson et al. (Nelson, P.S., et 
al. (1999) Proc. Natl Acad. Set 96, 31 14-31 19). However, the biological functions of the 
proteins coded by the PSA, hKLK2 and prostase genes are poorly understood at present. 

Adding to the difficulties in understanding the role of androgens in prostate cancer is that 
in vivo and in vitro model systems frequently do not closely mimic the human disease. 
Furthermore, close homologues of the PSA gene, the only marker for prostate cancer in human, 
are not known in other animal species. Moreover, in vitro studies are hampered due to the 
limited number of cell lines that have been derived from human prostate. For example, the only 
androgen sensitive and androgen responsive cell line that is widely used is LNCaP, characterized 
by cells originally derived from a lymph node metastasis of a human prostate carcinoma 
(Horoszewicz, J.S., Leong, S.S., Kawinski, E., Karr, J.P., Rosenthal, H., Chu, T.M., Mirand, 
E.A., and Murphy, G.P. (1983). Cancer Res. 43, 1809-1818). 

In spite of numerous studies on the effects of androgens in the role of prostate cancer, 
relatively little is known about the molecular genetic effects of androgens in prostate cells. More 
detailed knowledge about androgen responsive genes and their role in signal transduction, as 
well as gross morphological and physiological transformations will potentially result in better 
diagnostic tools, and provide possible new targets for a drug-based therapy of prostate cancer. 
Therefore, there is still a need to provide improved methods to identify androgen responsive, 
differentially expressed genes in prostate cancer. 

Summary of the Invention 

The present invention is directed to differentially expressed genes in neoplastic cells, and 
particularly relates to hormone dependent genes in prostate cancer. The polynucleotides with the 
SEQ ID NO:l - SEQ ID NO:7 encode an intracellular protein, and while the corresponding poly- 
peptides SEQ ID NO: 1 1 - SEQ ID NO: 14 have an intracellular location, the corresponding ex- 
pression products SEQ ID NO:8 - SEQ ID NO: 10 have predominantly perinuclear, nuclear and 
predominantly nuclear localization within a cell, respectively. 
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In one aspect of the inventive subject matter, SEQ ID NO:l - SEQ ID NO: 7 are expres- 
sed in prostate cancer cells in a hormone dependent manner. 

In another aspect of the inventive subject matter, a method of detecting a neoplastic cell 
in a system includes a step in which a predetermined amount of an RNA comprising at least one 
5 of SEQ ID NO: 1 5 - SEQ ID NO:21 is correlated with the presence of a neoplastic cell, and the 
predetermined amount, or more, is subsequently detected in the system. Contemplated detection 
methods preferably employ a labeled probe that is detectable via fluorescence detection, 
luminescence detection, scintigraphy, autoradiography, or formation of a dye. Alternative 
preferred detection methods include addition of at least one nucleotide to the probe (e.g., PCR, 
lp^ LCR). 

% 4 In a further aspect of the inventive subject matter, a method of detecting a neoplastic cell 

jTj in a system includes a step in which a predetermined amount of a polypeptide comprising at least 

one of SEQ ID NO:8 - SEQ ID NO: 14 is correlated with the presence of a neoplastic cell, and 
fU the predetermined amount, or more, is subsequently detected in the system. Contemplated detec- 
ts^ tion methods preferably employ a labeled probe that is detectable via fluorescence detection, 
f ~ luminescence detection, scintigraphy, autoradiography, or formation of a dye, and preferred 
p probes include antibodies, antibody fragments, and natural and synthetic ligands of the polypep- 

2 tide - 

In a still further aspect of the inventive subject matter, a method of identifying differenti- 
20 ally expressed genes in a target tissue has one step in which a target tissue-specific cDNA library 
is prepared by suppression subtractive hybridization, and a plurality of genes from the library is 
immobilized on a solid phase. Nucleic acid preparations from treated and untreated target tissue 
are individually hybridized with array, respectively, and hybridization patterns are compared to 
identify differentially expressed genes. 

25 Various objects, features, aspects and advantages of the present invention will become 

more apparent from the following detailed description of preferred embodiments of the inven- 
tion, along with the accompanying drawing. 
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Brief Description of The Drawing 

Figures 1 A and IB depict schematic nucleic acid and amino acid based multiple 
sequence alignments of SEQ ID NO:l - SEQ ID NO:7 and SEQ ID NO:8 - SEQ ID NO:13, 
respectively. 

5 Figure 2 is a photograph of an exemplary reverse northern blot of several clones from a 

cDNA library of androgen treated prostate cancer cells. 

Figure 3 is a photograph of an exemplary multiple tissue northern blot of one of the 
isolated polynucleotides. 

jj=! Figure 4 is a photograph of an agarose gel after electrophoretic separation of splicing 

ICS variants of SEQ ID NO: 1 . 

m Figure 5 is a series of photomicrographs illustrating the intracellular localization of GFP- 

-J fusion proteins of polypeptides of SEQ ID NO:8 - SEQ ID NO: 1 0. 

14= Figure 6 is an autoradiograph of a northern blot indicating hormone dependent 

lZ expression of SEQ ID NO:l - SEQ ID NO:7. 

IS Detailed Description 

As used herein, the term "intracellular protein" refers to a protein that is expressed and 
retained within a cell irrespective of its subcellular localization. For example, DNA polymerase 
(nucleus), glyceraldehyde-3-phosphate dehydrogenase (cytoplasm), and cytochrome c oxidase 
(mitochondria) are all considered intracellular proteins. In contrast, the term "extracellular 
20 protein" refers to a protein that is exported from a cell. There are various mechanisms by which a 
protein can be exported from a cell, all of which are contemplated herein. For example, while 
many exported proteins have a signal sequence that allows specific export of the protein across a 
cell membrane, other proteins are exported through vesicles via the endoplasmatic reticulum, etc. 

As also used herein, a protein has a "predominantly perinuclear localization" when a 
25 majority of the protein (i.e., more than 50% of the total amount as fluorimetrically detectable by 
GFP-fusion) is located in a volume around the nucleus that does not exceed a volume greater 
than three volumes of the nucleus. A protein has a "predominantly nuclear localization" when a 
majority of the protein is located within the nucleus. In contrast, the term "nuclear localization" 
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means that substantially all of the detectable protein is located within the nucleus. Localization 

c 

of a protein can be determined fluorimetrically by GFP-fusion (GFP: green fluorescence 
protein). 

Employing a novel combined SSH-reverse northern blot procedure (first disclosed in 
provisional application 60/135,325 filed May 20, 1999), we isolated seven cDNAs (SEQ ID 
NO:l - SEQ ID NO:7) from human prostate cancer cells. All seven cDNA molecules code for 
an intracellular protein, which is expressed in an androgen and glucocorticoid dependent manner. 
Remarkably, the sequences SEQ ID NO:l - SEQ ID NO:7 show a high degree of 
homology/identity with prostase, a previously reported serine protease (Nelson, P.S., et al. 
(1999) Proc. Natl Acad. Set 96, 3114-31 19). However, in comparison with the prostase, all of 
the sequences SEQ ID NO:l - SEQ ID NO:7 lack a 49 amino acid N-terminal portion correspon- 
ding to the first exon of the prostase. It should be especially appreciated that the first exon of the 
prostase not only includes structurally important amino acids, but also includes a signal peptide 
sequence that renders the serine protease a secreted, extracellular enzyme. Consequently, the 
cDNA molecules with the sequence of SEQ ID NO: 1 - SEQ ID NO:7 encode intracellular 
proteins that are functionally and structurally different from the prostase, and are therefore inde- 
pendent and novel genes. 

With respect to the base composition of SEQ ID NO:l - SEQ ID NO:7, nucleotide se- 
quences other that SEQ ID NO:l - SEQ ID NO:7 are also contemplated and particularly include 
variations of SEQ ID NO:l - SEQ ID NO: 7 that include point mutations, insertions, deletions 
and any reasonable combination thereof, so long as alternative sequences encode an intracellular 
protein. Therefore, polynucleotides are contemplated that have at least 80%, preferably at least 
85%, more preferably at least 90% and most preferably at least 95% identity with the sequences 
of SEQ IDNO:l - SEQ IDNO:7, so long as contemplated polynucleotides encode an intracellular 
protein. 

For example, point mutations may arise at any position of the sequence from an apurinic, 
apyrimidinic, or otherwise structurally impaired site within the cDNA. Alternatively, point 
mutations may be introduced by random or site-directed mutagenesis procedures {e.g., oligonu- 
cleotide assisted or by error prone PCR). Likewise, deletions and/or insertions may be 
introduced into the sequences, and particularly preferred insertions comprise 5 f - and/or 3 '-fusions 
with a polynucleotide that encodes a reporter moiety or an affinity moiety. Other particularly 
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preferred insertions comprise a nucleic acid that further includes functional elements such as a 
promoter, enhancer, hormone responsive element, origin of replication, transcription and 
translation initiation sites, etc. It should especially be appreciated that where insertions with one 
or more functional elements are present, the resulting nucleic acid may be linear or circular (e.g., 
5 transcription or expression cassettes, plasmids, etc.). 

Still further contemplated variations include substitution of one or more atoms or chemi- 
cal groups in the sequence with a radioactive atom or group. For example, where contemplated 
cDNAs are employed as a hybridization-specific probes, a fluorophor or enzyme (e.g., p-galacto- 
sidase for generation of a dye, or luciferase for generation of luminescence) may be coupled to 

ID the sequence to identify position and/or quantity of a complementary sequence. Alternatively, 
where contemplated cDNA molecules are utilized for affinity isolation procedures, the cDNA 

a g may be coupled to a molecule that is known to have a high-affinity (i.e., K d <10" ) mol" 1 ) partner, 

r . 5 

;!f such as biotin, or an oligo-histidyl tag. In another example, one or more phosphate groups may 
CO be exchanged for a radioactive phosphate group with a 32 P or 33 P isotope to assist in detection 

£ S 3 

J3 and quantification, where the radiolabeled cDNA is employed as a hybridization probe. 

r~ It is also contemplated that the polypeptides (encoded by SEQ ID NO: 1 - SEQ ID NO:7) 

□ having the peptide sequence of SEQ ID NO: 8 - SEQ ID NO: 14 may be produced in vivo or in 
12 vitro, and may be chemically and/or enzymatically modified. Contemplated polypeptides can be 
isolated from prostate tissue or prostate cancer cells that may or may not be in a honnone depen- 
20 dent state. Alternatively, and especially where larger amounts (z.e., >10mg) are desirable, recom- 
binant production (e.g., in a bacterial, yeast, insect cell, or mammalian cell system) may advanta- 
geously be employed to generate significant quantities of contemplated polypeptides. 

It should further be appreciated that recombinant production not only offers a more eco- 
nomical strategy to produce contemplated polypeptides, but also allows specific modification in 

25 the amino acid sequence and comppsition to tailor particular biochemical, catalytic and physical 
properties. For example, where increased solubility of contemplated polypeptides is desirable, 
one or more hydrophobic amino acids may be replaced with hydrophilic amino acids. Alternati- 
vely, where reduced or increased catalytic activity is required, one or more amino acids may be 
replaced or eliminated. In still another example, fusion proteins with contemplated proteins are 

30 contemplated, in which an additional polypeptide is added to the N-terminus and/or C-terminus 
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of contemplated polypeptide. Particularly contemplated fusion proteins include fusions with en- 
zymatically active fusion partners (e.g., for dye formation or substrate conversion) and fluores- 
cent fusion partners such as GFP, EGFB, BFP, etc. Therefore, sequences other than the sequen- 
ces of SEQ ID NO:8 - SEQ ID NO:14 are also contemplated, so long as the polypeptides are 
intracellular proteins, and alternative peptide sequences may have a sequence that has a 70%, 
preferably 80%, more preferably 90%, and most preferably 95% homology to the sequences of 
SEQ ID NO:8 - SEQ ID NO: 14. 

With respect to chemical and enzymatic modifications of contemplated polypeptides, it is 
contemplated that many modifications are appropriate, including addition of mono-, and Afunc- 
tional linkers, coupling with protein- and non-protein macromolecules, and glycosylation. For 
example, mono- and Afunctional linkers are especially advantageous where contemplated poly- 
peptides are immobilized to a solid support, or covalently coupled to a molecule that enhances 
immunogenicity of contemplated polypeptides (e.g., KLH, or BSA conjugation). Alternatively, 
contemplated polypeptides may be coupled to antibodies or antibody fragments to allow rapid 
retrieval of the polypeptide from a mixture of molecules. Further contemplated couplings include 
covalent and non-covalent coupling of contemplated polypeptides with molecules that prolong 
the serum half-life and/or reduce immunogenicity such as cyclodextranes and polyethylene 
glycols. 

In a particularly contemplated aspect of the inventive subject matter, SEQ ID NO: 15 - 
SEQ ED NO:21 (the corresponding mRNA of SEQ ID NO: 1 - SEQ ID NO:7) are employed in a 
method of detecting a neoplastic cell in a system. In one step, a predetermined quantity of an 
RNA comprising at least one of SEQ ID NO:15 - SEQ ID NO:21 in a cell containing system is 
correlated with the presence of a neoplastic cell, wherein the RNA encodes an intracellular 
polypeptide, and in a further step, an amount of at least the predetermined quantity of the RNA is 
detected in the system. 

In a preferred embodiment, the system iAa mammal (most preferably a human) and the 
neoplastic cell is a prostate cancer cell in a biops^specimen. The total RNA is extracted from the 
biopsy specimen, and a real time quantitative rt-POR employing individual reactions with primer 
pairs specific to each of the sequences of SEQ ID NO: 15 - SEQ ID NO:21 is performed in 
parallel with a biopsy specimen known to be free of iancer cells. Biopsy specimens are deter- 
mined to have a cancer cell, where the detected mRNA quantity of SEQ ID NO: 15 - SEQ ID 
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[£ \ NO:21 is at least 3 times higher than in the conirol specimen. A preferred extraction of total 

RNA utilizes the Quiagen BioRobot kit in conjunction with the BioRobot 9600 system, and the 
f real time rtPCR is performed in a Perkin Elmer ABI Prism 7700. 

In alternative aspects of the inventive subject matter, the method of detecting a neoplastic 
5 cell need not be limited to biopsy tissues from prostate tissue, but may employ various alterna- 
tive tissues, including lymphoma tumor cells, and various solid tumor cells, so long as such 
tumor cells overproduce mRNA of the SEQ ID NO: 15 - SEQ ID NO:21 . Appropriate alternative 
tumor cells can readily be identified by the above described method. Likewise, the system need 
not be restricted to a mammal, but may also include cell-, and tissue cultures grown in vitro, and 
HI tumor cells and specimens from animals other than mammals. 

s 4 For example, tumor cell and tissue grown in vitro may advantageously be utilized to 

investigate drug action on such cells, and the overabundance of sequences of SEQ ID NO: 15 - 
^ SEQ ID NO:21 may conveniently be employed as tumor marker. Alternatively, body fluids (e.g., 
FU serum, saliva, etc.) that may or may not contain tumor cells are also contemplated a suitable 
f & substrate for the method presented herein, so long as they contain to at least some extent mRNA 
\ "2 with a sequence of SEQ ID NO: 1 5 - SEQ ID NO:21 . 

C3 With respect to the detection method it is contemplated that many methods other than 

quantitative real time rt-PCR are also appropriate, and particularly contemplated methods in- 
clude hybridization of a probe to at least one of SEQ ID NO: 15 - SEQ ED NO:21. It is especially 

20 contemplated that suitable probes are labeled, and depending on the physico-chemical nature of 
the probe, the detection process may include fluorescence detection, luminescence detection, 
scintigraphy, autoradiography, and formation of a dye. For example, for microscopic analysis of 
biopsy specimens, fluorescein modified sequence probes (complementary to at least one of SEQ 
ID NO:15 - SEQ ID NO:21) are particularly advantageous. Fluorescence quantification may then 

25 be performed utilizing a CCD-video .analysis package. Similarly, luminescence may be detected 
with a luminometer coupled to a microscope, or where tissue pieces are submerged in a sample 
cuvette, luminescence may be determined in the sample fluid. It should be appreciated that 
labeling of oligonucleotides and hybridization of the labeled oligonucleotide is a technique that 
is well known in the art, and that all known methods are generally suitable for use in conjunction 

30 with methods contemplated herein. Alternatively, the amount of mRNA may also be determined 
by first hybridizing a probe to the mRNA and subsequently enzymatically coupling of at least 
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one nucleotide to the probe, and especially contemplated enzymatic additions include LCR and 
PCR. 

In still other aspects of contemplated methods, the mRNA quantity need not necessarily 
be limited to at least 3 times more than in the control specimen in order to establish that the 
tissue has a cancer cell. For example, where the concentration of mRNA is hormone dependent, 
higher amounts between 3-8 fold and more may be appropriate. In contrast, where the concen- 
tration of cancer cells in the biopsy specimen is relatively low, amounts of less than 3-fold, in- 
cluding 1.5 to 2.9-fold and less are contemplated. 

In another particularly contemplated aspect of the inventive subject matter, polypeptide 
of SEQ ID NO: 8 - SEQ ID NO: 14 (encoded by SEQ ID NO: 1 - SEQ ID NO:7) are employed 
in a method of detecting a neoplastic cell in a system. In one step, a predetermined quantity of an 
intracellular polypeptide comprising at least one of SEQ ID NO:8 - SEQ ID NO: 14 in a cell 
containing system is correlated with the presence of a neoplastic cell, and in a further step, an 
amount of at least the predetermined quantity of the RNA is then detected in the system. 

In a preferred embodiment, the system is a mammal (most preferably a human) and the 
neoplastic cell is a prostate cancer cell or a breast cancer cell in a biopsy specimen. The biopsy 
specimen that is suspected to have a cancer cell is flash frozen, dissected on a microtome, and 
sections are mounted on microscope slides. The sections are subsequently incubated with a 
fluorescein labeled antibody that is directed against an epitope of at least one of the polypeptides 
of SEQ ID NO:8 - SEQ ID NO: 14. Fluorescence is detected with a fluorescence microscope 
coupled to a CCD-video camera and image analysis equipment. Biopsy specimens are deter- 
mined to have a cancer cell, where the fluorescence signal/quantity of one or more cells is at 
least 3 times higher than in the control specimen. 

In alternative aspects of the inventive subject matter, the method of detecting a neoplastic 
cell need not be limited to biopsy tissues from prostate tissue, but may employ various alterna- 
tive tissues, including lymphoma tumor cells, and various solid tumor cells, so long as such 
tumor cells overproduce polypeptides of the SEQ ID NO:8 - SEQ ID NO: 14. Appropriate alter- 
native tumor cells can readily be identified by the above described method. Likewise, the system 
need not be restricted to a mammal, but may also include cell, and tissue cultures grown in vitro, 
and tumor cells and specimens from animals other than mammals. For example, tumor cell and 
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tissue grown in vitro may advantageously be utilized to investigate drug action on such cells, and 
the polypeptides of SEQ ID NO:8 - SEQ ID NO: 14 may conveniently be employed as a tumor 
marker. Alternatively, body fluids (e.g., serum, saliva, etc.) that may or may not contain tumor 
cells are also contemplated as suitable substrates for the method presented herein, so long as they 
contain to at least some extent the polypeptides of SEQ ID NO:8 - SEQ ID NO: 14. 

With respect to detection methods, it is contemplated that many methods other than 
fluorescence microscopy are also appropriate, and particularly contemplated methods include 
specific binding of a probe to at least one of SEQ ID NO:8 - SEQ ID NO: 14. It is especially 
contemplated that suitable probes are labeled, and depending on the physico-chemical nature of 
the probe, the detection process may include fluorescence detection, luminescence detection, 
scintigraphy, autoradiography, and formation of a dye. 

For example, for microscopic analysis of biopsy specimens, luciferase labeled probes are 
particularly advantageous in conjunction with a luminescence substrate (e.g., luciferin). Lumi- 
nescence quantification may then be performed utilizing a CCD-camera and image analysis sys- 
tem. Similarly, radioactivity may be detected via autoradiographic or scintigraphic procedures on 
a tissue section, in a fluid or on a solid support. Where the probe is a natural or synthetic ligand 
of contemplated polypeptides, particularly contemplated ligands include molecules with a 
chemical modification that increase the affinity to the polypeptide and/or induce irreversible 
binding to the polypeptide. For example, transition state analogs or suicide inhibitors for a 
particular reaction catalyzed by the polypeptide are especially contemplated. Labeling of 
antibodies, antibody fragments, small molecules, and binding of the labeled entity is a technique 
that is well known in the art, and it is contemplated that all known methods are generally suitable 
for use in conjunction with methods contemplated herein. Furthermore, the probe need not be 
limhed to a fluorescein labeled antibody, and alternative probes include antibody fragments (e.g., 
Fab, Fab', scFab, etc.). 

In still other aspects of contemplated methods, the polypeptide quantity need not neces- 
sarily be limited to at least 3 times more than the control specimen in order to establish that the 
tissue has a cancer cell (e.g., where the control reads lOOng, three times more than the control 
means 300ng). For example, where the concentration of the polypeptide is hormone dependent, 
higher amounts between 3-8 fold and more may be appropriate. In contrast, where the concen- 
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tration of cancer cells in the biopsy specimen is relatively low, amounts of less than 3-fold, in- 
cluding 1.5 to 2.9-fold and less are contemplated. 

It should further be appreciated that the polynucleotides of SEQ ID NO: 1- SEQ ID NO:7 
and SEQ ID NO: 1 5 - SEQ ID NO:2 1 may be employed as a therapeutic modality in an antisense 
DNA/RNA based therapy. Anti-sense therapy, for example, could be employed to inhibit, up-, or 
down-regulate transcription or translation of the genes corresponding to SEQ ID NO:l- SEQ ID 
NO:7. It should further be appreciated that an anti-sense approach may also include regulatory 
sequences associated with SEQ ID NO:l- SEQ ID NO:7 such as transcription enhancers, hor- 
mone responsive elements, ribosomal- and RNA polymerase binding sites, etc., which may be 
located upstream or downstream of SEQ ID NO: 1- SEQ ID NO:7, and may have a distance of 
several ten base pairs to several ten thousand base pairs. 

Alternatively, the polypeptides of SEQ ID NO:8- SEQ ID NO: 14 may also be employed 
in an antibody based therapy or a small molecule drug therapy directed towards the polypeptides 
of SEQ ID NO:8- SEQ ID NO: 14. For example, antibody based therapy could be employed to 
neutralize, or remove corresponding polypeptides of SEQ ID NO:8- SEQ ID NO: 14 in-vivo 9 or 
to interfere with one or more cellular functions of contemplated polypeptides. 

Figures 1 A and IB show a schematic and an amino acid based alignment between the 
cDNAs of SEQ ID NO:l - SEQ ID NO:7, in which SEQ ID NO:l is the full-length cDNA and 
SEQ ID N02 - SEQ ID NO:7 are splicing variants of SEQ ID NO:l. In the amino acid based 
alignment SEQ ID NO:8 is the corresponding polypeptide to SEQ ID NO:l, and SEQ ID NO:9 - 
SEQ ID NO: 14 are the corresponding polypeptides to SEQ ID NO:2 - SEQ ID NO:6. 

Examples 

The following examples illustrate the isolation and cloning of the sequences of SEQ ID 
NO: 1 - SEQ ID NO:7 form normal prostate tissue and from prostate cancer cells. SEQ ID NO:8 - 
SEQ ID NO: 14, and SEQ ID N015 - SEQ ID NO:21 are computer generated transcriptions and 
translations of SEQ ID NOl - SEQ ID NO:7, respectively. 

The following examples also illustrate a general method of identifying differentially ex- 
pressed genes in a target tissue, in which in one step a target tissue-specific cDNA library is 
provided that has a plurality of tissue-specific genes obtained by suppression subtractive 
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hybridization. In a subsequent step, a predetermined quantity of tissue-specific genes is 
immobilized on a solid phase to form a tissue-specific cDNA array, and a first nucleic acid 
preparation is hybridized to a first tissue-specific cDNA array to create a first hybridization 
pattern, wherein the first preparation is prepared from the target tissue without previously 
exposing the target tissue to a compound. In a further step, a second nucleic acid preparation is 
hybridized to a second tissue-specific cDNA array to create a second hybridization pattern, 
wherein the second preparation is prepared from the target tissue after previously exposing the 
target tissue to a compound. In yet a further step, the first and the second hybridization pattern 
are then compared to identify differentially expressed genes. This general method is especially 
contemplated where the compound comprises a hormone, or various other suitable ligands. 

Su ppression Subtraction of Prostate Specific Genes 

cDNA derived from poly(A)+ RNA of 10 different normal human tissues were subtrac- 
ted against normal human prostate cDNA using suppression subtraction hybridization (SSH) 
(Diatchenko, L., Lau, Y.-F- C, Campbell! A.P., Chenchik, A., Moqadam, F., Huang, B., Luky- 
anov, S., Lukyanov, K., Gurskaya, N., SveVdlov, E.D., Siebert, P.D. (1996). Proa Natl Acad. 
Sci. USA 93, 6025-6030), and the resultingVsDNA fragments were cloned into an appropriate 
vector. SSH was performed as described (ClWech PCR-Select Cloning Kit) using prostate 
pc >y(A)+ RNA against a pool of poly(A)+ RNA obtained from ten normal human tissues (heart, 
brain, placenta, lung, liver, skeletal muscle, kidney, spleen, thymus, and ovary). Upon secondary 
PCR amplification (12 cycles), reactions were extracted with phenol/chloroform and DNA was 
precipitated with EtOH. \ 

The pellet was washed once with 70% EtQH. After drying, the DNA pellet was dissolved 
in 0.2xTE or dH 2 0 and cut with Rsal in a 20 ul reaction for 2 hrs at 37C to excise adaptors. After 
digestion, reactions were run on a 1.5 % agarose geK with molecular size markers on one side, at 
5 V/cm, 40 min. The adopter bands ate excised and aiscarded, and cDNA bands were cut out and 
purified (QAIEX gel DNA purification kit) after runnjbg the gel backwards to concentrate the 
cDNA. ; V 

The purified DNA was subcloned into EcoRV-cut, dephosphorylated pZERO vector 
from Invitrogen. DH10B electrocompetent cells (>10 10 efficiency) were transformed with a 1/5 
dilution of l |il of the ligation mix. \ 
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Colonies were picked and the presence of cDNA inserts confirmed by PCR with T7 and 
SP6 primers directly from the colonies. 10% of reactions were run on a 1.5% agarose gel to 
visualize amplified products. The colonies with inserts were grown and glycerol stocks (15%) 
were prepared and stored at -80C. 

5 

Reverse Northern Blot and Sequence Analyses 

Clones from the SSH library were amplified by PCR and spotted on nylon filters in 96- 
well format to generate two identical blots for each set of 92 clones (the remaining four spots 
were used for positive and negative controls). For probe preparation, the androgen-responsive 
10 ^ prostate cancer cell line LNCaP (Horoszewicz, J.S., Leong, S.S., Kawinski, E., Karr, J.P., 
2 Rosenthal, H., Chu, T.M., Mirand, E.A., and Murphy, G.P. (1983). Cancer Res. 43, 1809-1818) 
J was employed that was either untreated [the (-) probe] or treated with the synthetic androgen 
W R1881 for 24 hours [the (+) probe], Poly(A)+ RNA was isolated from these cells and was used 
r q to make the 32 P-labeIed probes. After hybridization with the (-) and (+) probes, clones showing 
15 ^ differential hybridization were selected for further analysis {i.e., confirmation by a secondary 
£3 reverse northern blot, and northern blotting). 

Reverse northern screening on the cDNA clones were done essentially as described else- 
Ji? where (Hedrick, S.M., Cohen, D.L, Neilson, E.A., Davis, M.M. (1984) Nature 308, 149-153; 
M Sakaguchi N, Berger CN, Melchers F (1986). EMBOJ5: 2139-2147). DNA (approximately 400 
20 ng) from PCR amplification in step 6 was diluted in 200 fxl of 0.4M NaOH, 10 mM EDTA and 
mixed well by pipetting. After incubation at 95°C for 5-10 min, the tubes were chilled on ice. 
Denatured DNA was blotted on two separate pieces of Zeta Probe GT+ membrane (Bio-Rad) 
using a dot-blot apparatus (Bio-Rad). Positive [Prostate specific antigen (PSA) cDNA] and 
negative [glyceraldehyde 3-phosphate dehydrogenase (G3PDH) cDNA] controls are included on 
25 each blot in duplicate. Membranes were rinsed with 2XSSC, air dried, and then baked at 80°C 
for 30 min. A typical example of a reverse northern analysis is shown in Figure 2. In each blot 
pair, PSA and G3PDH are included as positive and negative controls, respectively. It should be 
noted that there was substantial increase in PSA hybridization in the (+) blot (probe prepared 
from cells that have been stimulated by androgens) compared with the (-) blot (probe prepared 
30 from untreated cells), whereas there was no significant change in hybridization of G3PDH 
between the two blots. Arrowheads indicate differentially expressed clones. 
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As a control to verify the prostate specific nature of isolated sequences, positive clones 
were tested in a standard northern blot against RNA preparations of multiple non-prostate tissues 
and a typical blot is shown in Figure 3. Lanes 1-10, and 12-16 are RNA preparations from non- 
prostate tissues, lane 1 1 is a RNA preparation from prostate, lane 12 is a RNA preparation from 
5 testis. 

Probes: The probes were random-prime radiolabeled using standard laboratory proce- 
dures. Unincorporated nucleotides were removed using prespun G25 columns (Bio-Rad), and 
specific activity was typically over 5xl0 8 cpm/jig. 

Hybridization: 25 ml Hybridization mix (7% SDS, 0.5 M NaHP0 4 , ImM EDTA) at 65°C 
10 ^ is prewarmed, and 12.5 ml were used for prehybridization of each membrane for 5-10 min at 65 
H °C. Probes were heat denatured at 95°C for 3-5 min and transferred to the prehybridization mix 
bj at 65°C. Hybridization was done at 65°C overnight. 

7S? - 

ffs 

;J Washing: Wash solution I (2xSSC and 1% SDS) and wash solution H (O.lxSSC and 

^ 0.5% SDS) were prewarmed. Membranes were washed once with Solution I and then Solution II 

15 m, for 30 min at 65°C, covered with plastic wrap and exposed to phoshorimager screen. 

}Z. Selection: Clones showing differential expression between the (-) and (+) blots were 

picked. A secondary round of reverse northern analysis is performed for confirmation by 
spotting each clone in duplicate on each blot. To confirm hormone dependence, a time course of 
R1881 induction of LNCaP cells as well as the CWR22 xenograft model upon androgen ablation 

20 (Wainstein, M. A., He, F., Robinson, D., Kung, H. J., Schwartz, S., Giaconia, J. M., Edgehouse, 
N. L., Pretlow, T. P., Bodner, D. R., Kursh, E. D., andPretlow, T.G. (1994). Cancer Res. 54, 
6049-6052) and the androgen-independent CWR22R relapsed xenograft (Nagabhushan, M., 
Miller, C. M., Pretlow, T. P., Giaconia, J. M. 5 Edgehouse, N. L., Schwartz, S., Kung, H. J., de 
Vere White, R. W., Gumerlock, P. H., Resnick, M. L, Amini, S. B. , and Pretlow, T. G. (1996). 

25 Cancer Res. 56, 3042-6) was used. 

Sequence analysis: Sequence analysis was performed by the dideoxy chain termination 
methods using an ABI automated sequencer. It should be appreciated that many more androgen 
responsive, differentially expressed genes can be identified and isolated using the cloning stra- 
tegy outlined above, including genes expressed during various growth and developmental phases 
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of a diseased prostate, and genes expressed as a result of a drug regimen. Moreover, it is contem- 
plated that not only differentially expressed prostate cancer genes can be identified and isolated, 
but also genes involved in other diseased states of human prostate, including benign prostate 
hyperplasia, etc. 

5 Isolation of Splice Variants (SEP ED NO: 2 - SEP ID NO: 7) 

Poly(A) + RNA was isolated from LNCaP cells treated with R1881 (a synthetic androgen) 
and from androgen dependent prostate cancer xenograft CWR22 grown in nude mice in the 
presence of androgens. cDNA was prepared and subjected to PCR using SEQ ID NO:l specific 
primers and a primer pair designed to amplify the previously published prostase. The respective 
1CH S'-primers were located around the translation start site, while the 3 f -primer for all reactions was 
« located around the stop codon. Reaction products were loaded onto an agarose gel and separated 
=S as shown in Figure 4. Lane 1 is a marker, lane 2 is positive control with SEQ ID NO:l as 
m template. Lanes 3-5 are PCR products from CWR22 cells with SEQ ID NO:l specific primers, 
;~ while lanes 6-8 are PCR products from CWR22 cells with prostase specific 5'-primer. Lanes 9- 
15 1 1 are PCR products from LNCaP cells with SEQ ID NO:l specific primers, and lanes 12-14 are 
12 PCR products from LNCaP cells with prostase specific primers. Lane 15 is marker. 

J3 Only reactions with SEQ ID NO: 1 specific primers yielded detectable PCR products, 

hi with a major band at 680bp (SEQ ID NO: 1), and two additional bands at about 500bp (SEQ ID 
NO:2) and 750bp (SEQ ED NO:3). When primers for 5-RACE analysis were employed, four 
20 additional PCR products were obtained, corresponding to SEQ ID NO:4, SEQ ID NO:5, and 
SEQ ID NO:6, respectively. All bands were sequenced to confirm their identity. SEQ ID NO:7 
was obtained both as a 3 f -RACE product as well as a distinct clone from the PSL library. 

Localization of Intracellular Proteins encoded by SEP ID NO: 1 - SEP ID NO: 3 

To gain insight into the subcellular location of the polypeptides SEQ ID NO:8 - SEQ ID 
25 NO: 10 (encoded by SEQ ID NO:l - SEQ ID NO:3), C-terminal fusion constructs with GFP were 
produced in COS7 cells. The cells were fixed, stained with DAPI and visualized by phase con- 
trast or fluorescence microscopy and representative images are shown in Figure 5. Photographs 
in lane A depict a GFP fusion protein with SEQ ID NO:9, the photographs in lane B depict a 
GFP fusion protein with SEQ ID NO: 10, the photographs in lane C depict a GFP fusion protein 
30 with SEQ ID NO:8, and the photographs in lane D depict a GFP protein as a control. The poly- 
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peptide of SEQ ID NO: 8 displayed strong granular fluorescence predominantly around the 
nucleus, while the polypeptide of SEQ ID NO:9 and SEQ ID NO: 1 0 showed exclusively nuclear 
and predominantly nuclear localization, respectively. Due to the lack of an identifiable leader 
sequence that would indicate an export of the polypeptides of SEQ ID NO:l 1 - SEQ ID NO: 14, 
5 it is contemplated that the sequences SEQ ID NO: 1 1 - SEQ ID NO: 14 are also intracellular 
proteins. 

Regulation of Expression bv multiple Hormones 

Untreated LNCaP cells and hormone treated LNCaP cells were employed to determine 
the hormone dependence of expression of SEQ ID NO:l. Treatment was as follows: Testoster- 
lQ a one (T) at 10* 8 M, dihydrotestosterone (DHT) at 10- 8 M, estradiol (E2) at 10 8 M, progesterone (P) 
j3 at 10" 8 M, dexamethasone (Dex) at 10" 7 M, 1, 25-dihydroxy- vitamin D3 (VitD3) at 10" 8 M, and 
triiodothyronine (T3) at 10* 7 M. The total RNA of the treated cells was isolated and used in a 
m northern blot analysis with radiolabeled SEQ ID NO:l as probe. Figure 6 shows the results of an 
to autoradiograph of a northern blot as described above. 1 8S-RNA is shown as control for RNA 
li integrity and loading. The relative induction of SEQ ID NO:l is indicated at the bottom of the 
12 lanes as determined by phosphorimager analysis. It is contemplated that SEQ ID NO:2 - SEQ ID 
H NO:7 are splice variants of SEQ ID NO: 1, and consequently it is contemplated that the express- 
p ion of all of SEQ ID NO: 1 - SEQ ID NO:6 is hormone dependent, and particularly contemplated 
hormones include androgens, progesterones, estrogens and glucocorticoids. 

20 Thus, specific embodiments and applications of methods and applications of differen- 

tially expressed genes in prostate cancer cells have been disclosed. It should be apparent, 
however, to those skilled in the art that many more modifications besides those already described 
are possible without departing from the inventive concepts herein. The inventive subject matter, 
therefore, is not to be restricted except in the spirit of the appended claims. Moreover, in 

25 interpreting both the specification and the claims, all terms should be interpreted in the broadest 
possible manner consistent with the context. In particular, the terms "comprises" and 
"comprising" should be interpreted as referring to elements, components, or steps in a non- 
exclusive manner, indicating that the referenced elements, components, or steps may be present, 
or utilized, or combined with other elements, components, or steps that are not expressly 

30 referenced. 
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SEQ ID NO:l 
atggaaaacg aattgttctg 
gcacactgtt tccagaactc 
caagagccag ggagccagat 
agacccttgc tcgctaacga 
gacaccatcc ggagcatcag 
gtttctggct ggggtctgct 
gtgtcggtgg tgtctgagga 
atgttctgcg ccggcggagg 
cccctgatct gcaacgggta 
caagttggcg tgccaggtgt 
accgtccagg ccagttaa 

SEQ ID NO: 2 
atggaaaacg aattgttctg 
gcacactgtt tccagaactc 
caagagccag ggagccagat 
agacccttgc tcgctaacga 
gacaccatcc ggagcatcag 
gtttctggct ggggtctgct 
gtacttgcag ggccttgtgt 
tgtctacacc aacctctgca 
a 




gctgtcagcc 60 
tgaggccgac 120 
agagtacaac 180 
gtccgagtct 240 
ctcttgcctc 300 
gtgcgtgaac 3 60 
ccaccccagc 420 
ctctgggggg 480 
cccgtgtggc 54 0 
gatagagaaa 600 
618 




ctggtgcatc 
gggctgggcc 
agcctctccg 
atcaagttgg 
cagtgcccta 
tgactctggg 
agccccgtgt 
gtggatagag 



cgcagtgggt 
tgcacagtct 
tacggcaccc 
acgaatccgt 
ccgcggggaa 
gggcccctga 
ggccaagttg 
aaaaccgtcc 



gctgtcagcc 60 
tgaggccgac 12 0 
agagtacaac 18 0 
gtccgagtct 240 
ctcttgcctc 300 
tctgcaacgg 360 
gcgtgccagg 420 
aggccagtta 4 80 
481 



SEQ ID NO: 3 

atggaaaacg aattgtteftg ctcgggcgtc ctggtgcatc cgcagtgggt gctgtcagcc 60 

gcacactgtt tccagaactc ctacaccatc gggctgggcc tgcacagtct tgaggccgac 120 

caagagccag ggagc/ragat ggtggaggcc agcctctccg tacggcaccc agagtacaac 180 
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agacccttgc tcgctaacga cctcatgctc atcaagttgg acgaatccg^ gtccgagtct 24 0 
gacaccatcc ggagcatcag cattgcttcg cagtgcccta ccgcggggAa ctcttgcctc 3 00 
gtttctggct ggggtctgct ggcgaacggt gagctcacgg gtgtgtofcct gccctcttca 360 
aggaggtcct ctgcccagtc gcgggggctg acccagagct ctgcgt/fccca ggcagaatgc 420 
ctaccgtgct gcagtgcgtg aacgtgtcgg tggtgtctga ggaggtctgc agtaagctct 480 
atgacccgct gtaccacccc agcatgttct gcgccggcgg aggcfcaagac cagaaggact 54 0 
cctgcaacgg tgactctggg ggggccctga tctgcaacgg gtalcttgcag ggccttgtgt 600 
ctttcggaaa agccccgtgt tggccaagtt ggcgtgccag gtigtctacac caacctctgc 660 
aaattcactg agtggataga gaaaaccgtc caggccagtt 702 

SEQ ID NO: 4 

ggaatgagcc tggatccggg gagcccagag ggaagggctg ggaggcggga atcttgcttc 60 
ggaaggactc agagagtcct gacttgaaat ctcag</ccag tgctgagtct ctagtgaact 12 0 
aagctcctac accatcgggc tgggcctgca cagtcttgag gccgaccaag agccagggag 180 
ccagatggtg gaggccagcc tctccgtacg gca!cccagag tacaacagac ccttgctcgc 24 0 
taacgacctc atgctcatca agttggacga atccgtgtcc gagtctgaca ccatccggag 300 
catcagcatt gcttcgcagt gccctaccgc ggggaactct tgcctcgttt ctggctgggg 3 60 
tctgctggcg aacggtgaac tcacgggtgj/gtgtctgccc tcttcaagga ggtcctctgc 420 
ccagtcgcgg gggctgaccc agagctctgc gtcccaggca gaatgcctac cgtgctgcag 4 80 
tgcgtgaacg tgtcggtggt gtctgaggag gtctgcagta agctctatga cccgctgtac 540 
caccccagca tgttctgcgc cggcg/aggg caagaccaga aggactcctg caacggtgac 600 
tctggggggc ccctgatctg caac^ggtac ttgcagggcc ttgtgtcttt cggaaaagcc 660 
ccgtgtggcc aagttggcgt gccaggtgtc tacaccaacc tctgcaaatt cactgagtgg 720 
atagagaaaa ccgtccaggc cagttaactc tggggactgg gaacccatga aattgacccc 780 
caaatacatc ctgcggaagg aattcaggaa tatctgatcc cagcccctcc tccc 834 



SEQ ID NO: 5 / 
ggaatgagcc tggatccggg gagcccagag ggaagggctg ggaggcggga atcttgcttc 60 
ggaaggactc agagagccct gacttgaaat ctcagcccag tgctgagtct ctagtgaact 120 
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aagctcctac accatcgggc tgggcctgca cagtcttgag gccgacfcaag agccagggag 180 

ccagatggtg gaggccagcc tctccgtacg gcacccagag tacaacagac ccttgctcgc 240 

taacgacctc atgctcatca agttggacga atccgtgtcc gagtfctgaca ccatccggag 300 

catcagcatt gcttcgcagt gccctaccgc ggggaactct tgfictcgttt ctggctgggg 360 

tctgctggcg aacggcagaa tgcctaccgt gctgcagtgc gfcgaacgtgt cggtggtgtc 420 
tgaggaggtc tgcagtaagc / 44 0 



SEQ ID NO: 6 

ggctctggga ggaggacgga atgagcctgg atccggg^ag cccagaggga agggctggga 60 
ggcgggaatc ttgcttcgga aggactcaga gagccctgac ttgaaatctc agcccagtgc 120 



tgagtctcta gtgaactaag ctcctacacc atcgggctgg gcctgcacag tcttgaggcc 180 

gaccaagagc cagggagcca gatggtggag gccagcctct ccgtacggca cccagagtac 24 0 

aacagaccct tgctcgctaa cgacctcatg ctcatcaagt tggacgaatc cgtgtccgag 300 

tctgacacca tccggagcat cagcattgct tcgcagtgcc ctaccgcggg gaactcttgc 360 



ctcgtttctg 


gctggggtct 


gctggcgaac 


ggcagaatgc 


ctaccgtgct 


gcagtgcgtg 


420 


aacgtgtcgg 


tggtgtctga 


ggaggtctgc 


^agtaagc 






457 


SEQ ID NO: 7 


/ 










accaccccag 


catgttctgc 


gccggcggjag 


agcaagacca 


gaaggactcc 


tgcaacgtga 


60 


gagaggggaa 


aggggagggc 


aggcgaotca 


gggaagggtg 


gagaaggggg 


agacagagac 


120 


acacagggcc 


gcatggcgag 


atgcagagat 


ggagagacac 


acagggagac 


agtgacaact 


180 


agagagagaa 


actgagagaa 


/ 

acagggaaat 


aaacacagga 


ataaagagaa 


gcaaaggaag 


240 


agagaaacag 


aaacagacat 


/ 

gggggaggca 


gaaacacaca 


cacatagaaa 


tgcagctgac 


300 


cttccaacag 


catggggcct 


i 

gagggcggtg 


acctccaccc 


aacagaaaat 


cctcttataa 


360 


cttttgactc 


cccaaaaaac 


ctgactagaa 


atagcctact 


gttgacgggg 


gagccttacc 


420 


aataacataa 


atagtcgatt , 


'tatgcatacg 


ttttatgcat 


tcatgatata 


cctttgttgg 


480 


aattttttga 


tatttctaag 


ctacacagtt 


cgtctgtgaa 


tttttttaaa 


ttgttgcaac 


540 


tctcctaaaa 


ttttttccaa 


tgtgtttatt 


gaaaaaaatc 


caagtataag 


tggacttgtg 


600 



cagttcaaac cagggttzgtt caagggtcaa ctgtgt 636 



WO 00/71711 



-20- 



PCT/IB00/00673 



SEQ ID NO: 8 
auggaaaacg aauuguucug 
gcacacuguu uccagaacuc 
caagagccag ggagccagau 
agacccuugc ucgcuaacga 
gacaccaucc ggagcaucag 
guuucuggcu ggggucugcu 
gugucggugg ugucugagga 
auguucugcg ccggcggagg 
ccccugaucu gcaacgggua 
caaguuggcg ugccaggugu 
accguccagg ccaguuaa 

SEQ ID NO: 9 
auggaaaacg aauuguucug 
gcacacuguu uccagaacuc 
caagagccag ggagccagau 
agacccuugc ucgcuaacga 
gacaccaucc ggagcaucag 
guuucuggcu ggggucugcu 
guacuugcag ggccuugguc 
gucuacacca accucugcaa 



cucgggcguc 
cuacaccauc 
gguggaggcc 
ccucaugcuc 
cauugcuucg 
ggcgaacggc 
ggucugcagu 
gcaagaccag 
cuugcagggc 
cuacaccaac 



cuggugcauc 
gggcugggcc 
agccucuccg 
aucaaguugc 
cagugcccua 
agaaugccua 
aagcuouaug 
aaggacuccu 
cuugugucuu 
Lcugcaaau 




gcugucagcc 60 
ugaggccgac 12 0 
agaguacaac 180 
guccgagucu 24 0 
cucuugccuc 300 
gugcgugaac 360 
ccaccccagc 420 
cucugggggg 4 80 
cccguguggc 54 0 
gauagagaaa 600 
618 



cuggugcauc 
gggcugggcc 
agccucuccg 
aucaaguugg 
cagugcccua 
ugacucuggg 
gccccgugug 
uggauagaga 



cgcagugggu 
ugcacagucu 
uacggcaccc 
acgaauccgu 
ccgcggggaa 
gggccccuga 
gccaaguugg 
aaaccgucca 



gcugucagcc 60 
ugaggccgac 120 
agaguacaac 180 
guccgagucu 24 0 
cucuugccuc 300 
ucugcaacgg 360 
cgugccaggu 420 
ggccaguuaa 480 



SEQ ID NO: 10 

auggaaaacg aauuguucug cucgggcguc cuggugcauc cgcagugggu gcugucagcc 60 
gcacacuguu uccagaacuc cuacaccauc gggcugggcc ugcacagucu ugaggccgac 120 
caagagccag ggagccagau gguggaggcc agccucuccg uacggcaccc agaguacaac 180 
agacccuugc ucgcuaacga ccucagcuca ucaaguugga cgaauccgug uccgagucug 24 0 
acaccauccg gagcaucagc auugcuucgc agugcccuac cgcggggaac ucuugccucg 3 00 
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KJL l_l w U«j W •>* 3 




tJ » ~J ^* ^* » ^3 3 


** *3 ^3 ^5 


W*^^ ^3 


cccucuucaa 


3 60 








cccaaaacuc 


UGcaucccaa 


^* 


420 


uaccgugcug 


cagugcguga 


acgugucggu 


ggugucugag 


gaggucugca 


guaagcucua 


480 


ugacccgcug 


uaccacccca 


gcauguucug 


cgccggcgga 


gggcaagacc 


agaaggacuc 


540 


cugcaacggu 


gacucugggg 


gggcccugau 


cugcaacggg 


^(acuugcagg 


gccuuguguc 


600 


uuucggaaaa 


gccccguguu 


ggccaaguug 


gcgugccagg/ 


ugucuacacc 


aaccucugca 


660 


aauucacuga 


guggauagag 


aaaaccgucc 


aggccaguua 


a 




701 



SEQ ID NO: 11 



ggaaugagcc 


uggauccggg 


gagcccagag 


ggaaggacug 

/ 


ggaaggacuc 


agagaguccg 


acuugaaauc 


ucagcccagu 


agcuccuaca 


ccaucgggcu 


gggccugcac 


agucuugagg 


cagaggugga 


ggccagccuc 


uccguacggc 


acc/agagua 


acgaccucau 


gcucaucaag 


uuggacgaau 


cciuguccga 

/ 


ucagcauugc 


uucgcagugc 


ccuaccgcgg 


ggaacucuug 


ugcuggcgaa 


cgggaacuca 


cgggugugug 


ucugcccucu 


gucgcggggg 


cugacccaga 


gcucugcguc/ 


Ccaggcagaa 


ugaacguguc 


gguggugucu 


gaggaggucu 

/ 


gcaguaagcu 


ccagcauguu 


cugcgccggc 


ggagggcaag 

/ 

ggguacuugc 

/ 


accagaagga 


gggggccccu 


gaucugcaac 


agggccuugu 


guggccaagu 


uggcgugcca 


ggugucuaca 


ccaaccucug 


agaaaaccgu 


ccaggccagu 


uaacucuggg 

/ 


gacugggaac 


uacauccugc 


ggaaggaauu 


caggaauauc 


ugaucccagc 



830 



SEQ ID NO: 12 

ggaaugagcc uggauccggg gpgcccagag ggaagggcgg gaggcgggaa ucuugcuucg 60 
gaaggacuca gagagcccug acuugaaauc ucagcccagu gcugagucuc uagugaacua 12 0 
agcuccuaca ccacgggcug/ggccugcaca gucuugaggc cgaccaagag ccagggagcc 180 
agauggugga ggccagccuc uccguacggc acccagagua caacagaccc uugcucgcua 240 
acgaccucau gcucaucaag uuggacgaau ccguguccga gucugacacc auccggagca 300 
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ucagcauugc uucgcagugc ccuaccgcgg ggaacucuug ccucgpuucu ggcugggguc 360 
ugcuggcgaa cggcagaaug ccuaccgugc ugcagugcgu gaacgugucg guggugucug 42 0 
aggaggucug caguaagc / 438 

SEQ ID NO: 13 

gcucugggag gaggacggaa ugagccugga uccggggagc dcagagggaa gggcugggag 60 
gcgggaaucu ugcuucggaa ggacucagag agcccugacu /ugaaaucuca gcccagugcu 120 
gagucucuag ugaacuaagc uccuacacca ucgggcuggg ccugcacagu cuugaggccg 180 
accaagagcc agggagccag augguggagg ccagccucuc cguacggcac ccagaguaca 24 0 
acagacccuu gcucgcuaac gaccucaugc caucaaguug gacgaauccg uguccgaguc 3 00 

—5 / 

J ugacaccauc cggagcauca gcauugcuuc gcagugcccu accgcgggga acucuugccu 360 

i J / 

« cguuucuggc uggggucugc uggcgaacgg cagaaugccu accgugcugc agugcgugaa 420 
n cgugucggug gugucugagg aggucugcag uaagc / 455 

'£Q ID NO: 14 

* accaccccag cauguucugc gccggcggag agcaagacca gaaggacucc ugcaacguga 60 
j gagaggggaa aggggagggc aggcgacuca gggaagggug gagaaggggg agacagagac 12 0 
k acacagggcc gcauggcgag augcagagau gcjagagacac acagggagac agugacaacu 18 0 
agagagagaa acugagagaa acagggaaau aaacacagga auaaagagaa gcaaaggaag 24 0 
agagaaacag aaacagacau gggggaggca gaaacacaca cacauagaaa ugcagcugac 3 00 
cuuccaacag cauggggccu gagggcggug/ accuccaccc aacagaaaau ccucuuauaa 360 
cuuuugacuc cccaaaaaac cugacuagaa auagccuacu guugacgggg gagccuuacc 420 
aauaacauaa auagucgauu uaugcauacg uuuuaugcau ucaugauaua ccuuuguugg 480 
aauuuuuuga uauuucuaag cuacacaguu cgucugugaa uuuuuuuaaa uuguugcaac 540 
ucuccuaaaa uuuuuucuaa uguguuuauu gaaaaaaauc caaguaaagu ggacuugugc 600 
aguucaaacc aggguuguuc aagggucaac ugugu 63 5 

SEQ ID NO: 15 

Met Glu Asn Glu Leu Phe <tys Ser Gly Val Leu Val His Pro Gin Trp 
Val Leu Ser Ala Ala His/Cys Phe Gin Asn Ser Tyr Thr lie Gly Leu 
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/ 



Gly 


T A** 

Lieu 


rixS 




Leu 


IjIU 


iila 


Asp 


bin 


W X. LI 


Pro 




' Ser 


VjJ.ll 




Va 1 
vai 


Glu 


Ala 


Ser 


T Ait 

Leu 


Ser 


vai 


Arg 


TT -1 — 

nlS 


Pro 


VjIU 


Tyr 


& eA 
AS.'il 


Arg 


Pro 


Leu 


Leu 


Aid 


Asn 


Asp 


Leu 


Mot- 
ive L- 


Leu 


Tip 

11c 


Lys 


Leu 




LI 


Ser 


Vai 


sex. 




wCX> 


Asp 


Thr 


1 le 


Arg 


Ser 


1X6 




Tip 

i i c 


Ala 
Aid 


O *a t" 


m n J 

U JL XXI 


'Cys 


Pro 


X 11 X. 


Ala 


Gly 


Asn 


Ser 


Cys 


Leu 


vai 


Ser 


vjJLy 


Trp 


m \r 

uiy 


JUC LI. 


x_icax 


Ala 


Asn 


Hi vr 


Arg 


Met 


Pro 


Th t* 


vaJ. 


Leu 


ox.ll 


Cys 


V Cl J. 


Asn 


V & J. 


Ser 


V/a.1 


Vai 


Ser 


Glu 


Glu 


Vai 


Cys 


Ser 


Lys 


Leu 


Tyr 


Asp 


lr X. U 


UC u 


Tvr 

±yx7 


His 


/Pro 


Ser^Met 


Phe 


Cys 


Ala 




Gly 


Gly 


Gin 


Asp 


uin 


Lys 


Asp 


Del 


C\rJ 

<-ys 




Gly Asp 


Car 

OCX, 


Gly Gly 


Fro 


Leu 


He 


Cys 


Asn 


vjiy 


Tyr 


Leu 


oin 


Cjxy 


Leu 


Vai 


Ser 


OVt a 

XT llC 


Gly Lys 


Aia 


Pro 


Cys 


Gly 


oin 


va i 


Lily 


vai 


Pro 


t>iy 


Veil 


Tyr 


Thr 


As n 


Leu 


Cys 


Lys 


Phe 


Thr 


Glu 


Trp 


Tl - 

lie 


blu 


Lys 


lnxy 


Vai 


VjJLII 


Ala 


Ser 










ID NO: 16 


























nec 


Glu 


Asn 


Glu 


Leu 


irne 


Cys 


Ser 


Gly 


Vai 


Leu 


Vai 


His 


Pro 


Gin 


Trp 


vai 


Leu 


Ser 


Ala 


A J. a 


illS 


Cys 




r«lll 


Asn 


Cpr 
OC1 


Tyr 


Thr 


11c 


Gly 


Leu 


Gly 


Leu 


His 


Ser 


Leu 


Glu 


Ala 


Asp 


Gin 


Glu 


Pro 


Gly Ser 


Gin 


Met 


Vai 


Glu 


Ala 


Ser 


Leu 


Ser 


Vai 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu 


Leu 


Ala 


Asn 


Asp 


Leu 


Met 


Leu 


He 


£ys 


Leu 


Asp 


Glu 


Ser 


Vai 


Ser 


Glu 


Ser 


Asp 


Thr 


He 


Arg 


Ser 


He 


Ser/ 


He 


Ala 


Ser 


Gin 


Cys 


Pro 


Thr 


Ala 


Gly 


Asn 


Ser 


Cys 


Leu 


Vai 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 







SEQ 


ID NO: 17 


























Met 


Glu 


Asn 


Glu 


Leu 


Phe' 


Cys 


Ser 


Gly 


Vai 


Leu 


Vai 


His 


Pro 


Gin 


Trp 


Vai 


Leu 


Ser 


Ala 


Ala 


Hi/s 


Cys 


Phe 


Gin 


Asn 


Ser 


Tyr 


Thr 


He 


Gly 


Leu 


Gly 


Leu 


His 


Ser 


Leu 


Gau 


Ala 


Asp 


Gin 


Glu 


Pro 


Gly 


Ser 


Gin 


Met 


Vai 


Glu 


Ala 


Ser 


Leu 


Ser 


Vai 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu 


Leu 


Ala 


Asn 


Asp 


Leu 


Met/ 


Leu 


He 


Lys 


Leu 


Asp 


Glu 


Ser 


Vai 


Ser 


Glu 


Ser 


Asp 


Thr 


He 


Arg 


Set 


He 


Ser 


He 


Ala 


Ser 


Gin 


Cys 


Pro 


Thr 


Ala 


Gly 


Asn 


Ser 


Cys 


Leu 


vAi 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 


Glu 


Leu 


Thr 


Gly Vai 


Cys 


/Leu 


Pro 


Ser 


Ser 


Arg 


Arg 


Ser 


Ser 


Ala 


Gin 


Ser 


Arg 
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Gly Leu Thr Gin Ser Ser Ala Ser Gin Ala Glu Cys Leu Pro Cys Cys 
Ser Ala / 

SEQ ID NO: 18 / 

Met Val Glu Ala Ser Leu Ser Val Arg His Pro/ Glu Tyr Asn Arg Pro 
Leu Leu Ala Asn Asp Leu Met Leu lie Lys Lert Asp Glu Ser Val Ser 

Glu Ser Asp Thr lie Arg Ser lie Ser lie Ala Ser Gin Cys Pro Thr 

Ala Gly Asn Ser Cys Leu Val Ser Gly Trp Gly Leu Leu Ala Asn Gly 

Glu Leu Thr Gly Val Cys Leu Pro Ser Ser Arg Arg Ser Ser Ala Gin 

Ser Arg Gly Leu Thr Gin Ser Ser Ala Ser /Gin Ala Glu Cys Leu Pro 

Cys Cys Ser Ala / 



SEQ ID NO: 19 / 

Met Val Glu Ala Ser Leu Ser Val Arg His Pro Glu Tyr Asn Arg Pro 

Leu Leu Ala Asn Asp Leu Met Leu life Lys Leu Asp Glu Ser Val Ser 

Glu Ser Asp Thr lie Arg Ser He Sfer He Ala Ser Gin Cys Pro Thr 

Ala Gly Asn Ser Cys Leu Val Ser Gly Trp Gly Leu Leu Ala Asn Gly 

Arg Met Pro Thr Val Leu Gin Cys/ Val Asn Val Ser Val . Val Ser Glu 
Glu Val Cys Ser Lys / 

SEQ ID NO: 20 / 

Met Val Glu Ala Ser Leu Ser /Val Arg His Pro Glu Tyr Asn Arg Pro 

Leu Leu Ala Asn Asp Leu Met Leu He Lys Leu Asp Glu Ser Val Ser 

Glu Ser Asp Thr He Arg SeTr He Ser He Ala Ser Gin Cys Pro Thr 

Ala Gly Asn Ser Cys Leu Val Ser Gly Trp Gly Leu Leu Ala Asn Gly 

Arg Met Pro Thr Val Leu /Gin Cys Val Asn Val Ser Val Val Ser Glu 

Glu Val Cys Ser Lys / 
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SEQ ID NO: 21 




Ala He Ser Ser Gin Val Phe Trp Gly Yal Lys Ser Tyr Lys Arg He 
Phe Cys Trp Val Glu Val Thr Ala Leu /Arg Pro His Ala Val Gly Arg 
Ser Ala Ala Phe Leu Cys Val Cys Val' Ser Ala Ser Pro Met Ser Val 
Ser Val Ser Leu Phe Leu Cys Phe Ser Leu Phe Leu Cys Leu Phe Pro 
Cys Phe Ser Gin Phe Leu Ser Leu yVal Val Thr Val Ser Leu Cys Val 
Ser Pro Ser Leu His Leu Ala Mety Arg Pro Cys Val 'Ser Leu Ser Pro 
Pro Ser Pro Pro Phe Pro Glu Sex Pro Ala Leu Pro Phe Pro Leu Ser 
His Val Ala Gly Val Leu Leu Val Leu Leu Ser Ala Gly Ala Glu His 
Ala Gly Val / 



